Picture for Shukai Liu

Shukai Liu

MIRA: Mid-training Rubric Anchoring for Source-Aware Data Selection

Add code
May 28, 2026
Viaarxiv icon

Action-Aware Generative Sequence Modeling for Short Video Recommendation

Add code
Apr 28, 2026
Viaarxiv icon

Context as a Tool: Context Management for Long-Horizon SWE-Agents

Add code
Dec 26, 2025
Viaarxiv icon

UMRE: A Unified Monotonic Transformation for Ranking Ensemble in Recommender Systems

Add code
Aug 11, 2025
Viaarxiv icon

IFEvalCode: Controlled Code Generation

Add code
Jul 30, 2025
Viaarxiv icon

Comment Staytime Prediction with LLM-enhanced Comment Understanding

Add code
Apr 02, 2025
Figure 1 for Comment Staytime Prediction with LLM-enhanced Comment Understanding
Figure 2 for Comment Staytime Prediction with LLM-enhanced Comment Understanding
Figure 3 for Comment Staytime Prediction with LLM-enhanced Comment Understanding
Figure 4 for Comment Staytime Prediction with LLM-enhanced Comment Understanding
Viaarxiv icon

FullStack Bench: Evaluating LLMs as Full Stack Coders

Add code
Dec 03, 2024
Figure 1 for FullStack Bench: Evaluating LLMs as Full Stack Coders
Figure 2 for FullStack Bench: Evaluating LLMs as Full Stack Coders
Figure 3 for FullStack Bench: Evaluating LLMs as Full Stack Coders
Figure 4 for FullStack Bench: Evaluating LLMs as Full Stack Coders
Viaarxiv icon

MdEval: Massively Multilingual Code Debugging

Add code
Nov 04, 2024
Figure 1 for MdEval: Massively Multilingual Code Debugging
Figure 2 for MdEval: Massively Multilingual Code Debugging
Figure 3 for MdEval: Massively Multilingual Code Debugging
Figure 4 for MdEval: Massively Multilingual Code Debugging
Viaarxiv icon

M2rc-Eval: Massively Multilingual Repository-level Code Completion Evaluation

Add code
Oct 28, 2024
Figure 1 for M2rc-Eval: Massively Multilingual Repository-level Code Completion Evaluation
Figure 2 for M2rc-Eval: Massively Multilingual Repository-level Code Completion Evaluation
Figure 3 for M2rc-Eval: Massively Multilingual Repository-level Code Completion Evaluation
Figure 4 for M2rc-Eval: Massively Multilingual Repository-level Code Completion Evaluation
Viaarxiv icon

REALM: RAG-Driven Enhancement of Multimodal Electronic Health Records Analysis via Large Language Models

Add code
Feb 10, 2024
Viaarxiv icon